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Abstract 

We examine the exclusion limits set by the CDF and DO experiments on the 
Standard Model Higgs boson mass from their searches at the Tevatron in the light of 
large theoretical uncertainties on the signal and background cross sections. We show 
that when these uncertainties are consistently taken into account, the sensitivity of 
the experiments becomes significantly lower and the currently excluded mass range 
Mh = 158-175 GeV could be entirely reopened. The necessary luminosity required 
to recover the current sensitivity is found to be a factor of two higher than the 
present one. 
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With its successful operation in the last years, the Fermilab Tevatron pp collider has 
now collected a substantial amount of integrated luminosity which allows the CDF and 
DO experiments to be sensitive to the Higgs particle, the remnant of the mechanism that 
breaks the electroweak gauge symmetry of the Standard Model (SM) and is at the origin 
of elementary particle masses [I1I2]- 

At the Tevatron, the main search channel for the SM Higgs boson is the top and 
bottom quark loop mediated gluon-gluon fusion mechanism gg ^ H with the Higgs 
boson decaying into WW pairs which lead to the clean ii/ii/ final states with i = e, fi. 
The subleading Higgs-strahlung processes qq — )■ WH, ZH add a little to the sensitivity, 
in particular at low Higgs masses. Strong constraints beyond the well established LEP 
bounds [5] have been recently set by the CDF and DO collaborations on the Higgs mass 
and the range Mh = 158-175 GeV has been excluded at the 95% confidence level (CL) 

Nevertheless, this exclusion limit relies crucially on the theoretical predictions for 
the cross sections of both the Higgs signal and the relevant SM backgrounds which, as 
is well known, are affected by significant uncertainties. In a recent study [3], it has 
been re-emphasized that this is indeed the case for the main Higgs search channel at 
the Tevatron: adding all sources of theoretical uncertainties in a consistent manner, one 
obtains an overall uncertainty of about ±40% on the gg ^ H —^iuiu signa0. This is much 
larger than the uncertainty assumed in the CDF/DO analysis, i.e. 10% for DO and 20% 
for CDF, thus casting some doubts on the resulting exclusion limit. 

In this letter, we confront the Tevatron exclusion Higgs limit with the theoretical 
uncertainties that affect the signal and background rates. We show that when they are 
included, the sensitivity of the the CDF/DO experiments is significantly lower than the 
currently quoted one. We estimate the necessary luminosity that is required to recover 
the current sensitivities and find that it should be higher than the present luminosity by 
a factor up to two. 

We begin our investigation by summarizing the impact of the theoretical uncertainties 
on the gg ^ H signal cross section which has a threefold problem. First, the perturbative 
QCD corrections to the cross section turned out to be extremely large: the i^-factor 
defined as the ratio of the higher order to the leading order (LO) [7] cross sections, is 
about a factor of two at next-to-leading order (NLO) [S] and about a factor of three 
at the next-to-next-to- leading order (NNLO) [D]. It is clear that it is this exceptionally 
large i^'-factor which allows a sensitivity to the Higgs boson at the Tevatron with the 
presently collected data. Nevertheless, the i^-factor is so large that one may question 
the reliability of the perturbative series. As a corollary, the possibility of still large higher 
order contributions beyond NNLO cannot be excluded. 

It has become customary to estimate the effects of these yet uncalculated higher order 
contributions from the variation of the cross section with the (renormalisation /i^ and 
factorisation fip) scale at which the process is evaluated. Starting from a median scale /iq 
which is taken to be /ii? = /^f = /^o = in the gg^ H process, the current convention 

-"^There are also uncertainties on the Higgs decay branching ratios, but they are very small in the 
excluded Mh range; see Ref. [6]. 



2 



is to vary these two scales within the range /io/'^ < /^/?; /^f < /^/^o with the constant factor 
chosen to be k = 2. However, as the QCD corrections are so large in the present case, it 
is wise to extend the domain of scale variation and adopt instead a value k = 3. This is 
the choice made in Ref. [S] which resulted in an (9(20%) scale uncertaintjl^ on c^^/p • 

Another problem that is specific to the gg ^ H process is that, already at LO, it 
occurs at the one-loop level with the additional complication of having to account for the 
finite mass of the loop particle. This renders the NLO calculation extremely complicated 
and the NNLO calculation a formidable task. Luckily, one can work in an effective field 
theory (EFT) approach in which the heavy loop particles are integrated out, making the 
calculation of the contributions beyond NLO possible. While this approach is justified for 
the dominant top quark contribution for Mh < 2mt, it is not valid for the fe-quark loop 
and for those involving the electroweak gauge bosons [11]. The uncertainties induced by 
the use of the EFT approach at NNLO are estimated to be of C(5%) [5]. 

A third problem is due to the presently not satisfactory determination of the parton 
distribution functions (PDFs). Indeed, in this process which is initiated by gg fusion, 
the gluon densities are poorly constrained, in particular in the high Bjorken-x regime 
which is relevant for Higgs production at the Tevatron. Furthermore, since the gg ^ H 
cross section is proportional to a1 at LO and receives large contributions at C(> a^), a 
small change of as leads to a large variation of c^^J^'^. Related to that is the significant 
difference between the world average value and the one from deep-inelastic scattering 
(DIS) data used in the PDFs [12]. 

Modern PDF sets provide a method to estimate these uncertainties by allowing a la 
(or more) excursion of the experimental data that are used to perform the global fits. 
In addition, the MSTW collaboration [13] provides a scheme that allows for a combined 
evaluation of the PDF uncertainties and the (experimental and theoretical) ones on ag- 
in Ref. [5], the combined 90% CL PDF+A^^Pa, + A*^a, uncertainty on aJJ!^)^ at the 
Tevatron, was found to be of order 15%. However, this (Hessian) method does not account 
for the theoretical assumptions that enter into the parametrization of the PDFs. A way 
to access this theoretical uncertainty is to compare the results for the central values of 
the cross section with the best-fit PDFs when using different parameterizations. 

In Fig. 1, displayed are the values of o"^^// obtained when using the gluon densities 
that are predicted by the four PDF set^ that have parameterizations at NNLO: MSTW 
[13], JR [H], ABKM [15] and HERAPDF [I6]. In the later case, two sets are provided: 
one with an as value that is close to that of MSTW and another one with the as that is 
obtained using DIS data alone. As can be seen, there is a very large spread in the four 
predictions, in particular at large Mh values where the poorly constrained gluon densities 
at high-x are involved. The largest rate is obtained with MSTW, but the cross section 
using the HERAPDF selQ with the small value is !^40% lower for Mj^ !^160 GeV. 



^See also Ref. (TU] for another reason to increase the scale uncertainty to 20%. 

■^We consider only NNLO PDFs as we make the choice of using partonic cross sections and PDFs that 
are consistently taken at the same order of perturbation theory. 

^It is often argued against the HERAPDF scheme, which uses consistently only HERA data to deter- 
mine the flavour decomposition, that it does not use any jet (Tevatron or DIS) data which is in principle 
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Figure 1: The gg ^ H cross section as a function of when the four NNLO PDF sets, 
MSTW, ABKM, JR and HERAPDF, are used. In the inserts, shown are the deviations 
with respect to the central MSTW value. 

A related issue, which is of utmost importance, is the way these various uncertain- 
ties should be combined. The CDF and DO experiments simply add in quadrature the 
uncertainties from the scale variation and the PDF uncertainties obtained through the 
Hessian method (and ignore the smaller EFT uncertainty) and they obtain an overall un- 
certainty of order 20% on the inclusive cross section. We believe (see also Ref. [TS]) that 
this procedure has no justificatioij^l- Indeed, the uncertainties associated to the PDFs in a 
given scheme should be viewed as purely theoretical uncertainties (due to the theoretical 
assumptions in the parameterization) despite of the fact that they are presented as the 
la or more departure from the central values of the data included in the PDF fits. In 
some sense, they should be equivalent to the spread that one observes when comparing 
different parameterizations of the PDFs. Thus, the PDF uncertainties should be con- 
sidered as having no statistical ground (or a flat prior in statistical language), and thus, 
combined linearly with the uncertainties from the scale variation and the EFT approach, 
which are pure theoretical errors. This is the procedure recommended, for instance, by 
the LHC Higgs cross section working group [12]. Another, almost equivalent, procedure 
has been proposed in Ref. |5]: one applies the combined PDE-a^ uncertainties directly 

important in the determination of the gluon densities. However, HERAPDF describes well not only the 
Tevatron jet data but also the W, Z data. Since this is a prediction beyond leading order, it has also the 
contributions of the gluon included. This gives an indirect test that the gluon densities arc predicted in 
a satisfactory way. See also Ref. |17j . 

^There were some responses to the addendum of Ref. [S] from CDF and DO on the tevnphwg . f nal . gov 
web site. While many comments were made on secondary and/or agreed points, the main issue (which 
explains the difference between our results) is the way to combine the scale and PDF uncertainties, and 
it was not really addressed. 
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on the maximal/minimal cross sections with respect to scale variatioijj, and then adds 
hnearly the small uncertainty from the EFT approach. This last procedure, that we have 
used here, provides an overall uncertainty that is similar (but slightly smaller) to that 
obtained with the linear sum of all uncertainties. 




Mh [GeV] 



Figure 2: The production cross section o'ggJ^n Tevatron using the MSTW PDFs, 

with the uncertainty band when all theoretical uncertainties are added as in Ref. [5] (BD). 
It is compared the uncertainties quoted by the CDF and DO experiments |1] as well as 
the uncertainty when the LHC procedure [19] is adopted. In the insert, the relative size 
of the uncertainties compared to the central value are shown. 

The overall theoretical uncertainty on ct^^h ^^^^ obtained this way, using MSTW 
PDFs, is shown in Fig. 2. In the mass range Mh ~ 160 GeV with almost the best sensi- 
tivity, one obtains a ~ +41%, —37% total uncertainty, to be compared to the ~10% and 
~20% uncertainties assumed, respectively, by the CDF and DO collaborations. We also 
show for comparison, the result obtained when one adds linearly, i.e. as recommended 
by the LHC Higgs cross section working group, the uncertainties from scale (+20%,— 17% 
on the sum of the jet cross section^Q and PDFs (+16%, -15% when the MSTW 68%CL 
PDF+A'^^Ptts error is multiplied by a factor of two following the PDF4LHC recommenda- 
tion), leading to a total of ~+36%,— 32% for Mh ^160 GeV. Thus, the uncertainty that 
we assume is comparable to the one obtained using the LHC procedure [19], the difference 
being simply due to the additional 0(5%) uncertainty from the use of the EFT approach 
that we also include. 

similar procedure has also been advocated in Ref. f^or top quark pair production. 
^An additional uncertainty of w 7.5% from jet acceptance is introduced when considering the Higgs+jet 
cross sections. We will consider it to be experimental and, when added in quadrature to others, will have 
little impact. 
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Let us stress again that the comparison between our values and those assumed by 
the experimental collaborations becomes even worse when the cross section is evaluated 
with another set of PDFs. For instance, with the HERA PDF parametrization, there is 
a reduction of ~ 40% of the normalisation compared to the central value adopted in the 
CDF/DO combined analysis. 

Thus if the ~ 20% total uncertainty assumed by the CDF collaboration is adopted, one 
can consider two scenarios. The first one is a reduction of o'^^h ~ 20% to account for 
the difference between the quadratic and (almost) linear ways of combining the individual 
uncertainties. A second scenario, would be simply to adopt the normalisation obtained 
using the HERA PDFs which gives a ~ 40% reduction of a^^j^. In both cases, the 
remaining ^ 20% uncertainty due to scale variation and the EFT will correspond to the 
overall theoretical uncertainty that has been assumed in the Tevatron analysis. 

So far, we have only addressed the issue of the signal rate. However, it is clear that 
one should equally consider the same uncertainties in the background cross sections. The 
by far largest background is pp^W~^W~ for which CDF/DO assume the inclusive cross 
section to be a = 11.34;^^3^"(scale)^2 5%(PDF) pb. We have reevaluated the rate using 
MCFM [21] and find a = 11.55+^J(scale)+^5(90%CL PDF) pb using the MSTW scheme 
(the errors due to ag are negligible here) which gives a = 11. 55^^^^ pb if the errors are 
added according to Ref. [5]. In fact, if we adopt the ABKM or HERAPDF sets, we would 
obtain a rate of, respectively, 12.35 pb and 11.81 pb. i.e ~ 9% higher in the maximal 
case. We will thus consider that a{pp — )■ VF"*"!^") can be ~ 10% larger/lower than 
assumed by CDF/D(i and we will consider a third scenario in which the normalization 
of the pp^WW background is changed by ±10%. 

Let us now come to the discussion of the Higgs Tevatron exclusion limit in the light of 
these theoretical uncertainties. We will base our exploration on the CDF study published 
in Ref. [22] which provides us with all the necessary details. In the analysis of the gg — )■ 
H — 7- WW — iiuu signal, the production cross section has been broken into the three 
pieces which yield different final state signal topologies, namely iiuu+Oiet, iiuu+l^et 
and ££z/z/+2jets or more. These channels which represent, respectively, ~ 60%, ~ 30% 
and ~ 10% of the total ct^^h [1H], have been studied separately. In the iiuu+Ojet and 
+ljet samples, two configurations have been analyzed, one with a high and one with a 
low signal over background ratio (depending on the quality of the lepton identification). 
In addition, a sample with a low invariant mass for the two leptons, M« < 16 GeV, has 
been included. Five additional channels resulting from the contributions of the Higgs- 
strahlung processes are also included: pp VH — )■ VWW leading to same sign dilepton 
and to trilepton final states. These channels give rather small signal rates, though. 

Our main goal is to estimate the necessary relative variation of the integrated luminos- 



® We have also evaluated a{pp—>-tt) under the same assumptions as [1] but with = 173.3±l.lGeV 
and find a{pp it) 7.07+g ^^^(scale)+g°o|f"(PDF + A'=''P+"^a^)±3.3%(Amt) pb, which leads using the 

procedure of [5] to a total uncertainty of Aa/a =t.i4e%i i-e niuch larger than the one assumed by CDF 
and DO. In the case of the Drell-Yan process, there is also a « 10% excess in the rate if one uses 
HERAPDF instead of the MSTW set: af^/' = 7.6 nb versus af^^ = 7 nb [17]. 



6 



ity needed to reproduce the currently quoted sensitivity of the CDF collaboration, if the 
normalization of the Higgs signal cross section (as well as the corresponding backgrounds) 
is different from the one assumed to obtain the results. 

Our approach consists of the following. First, we try to reproduce as closely as possible 
the CDF results using the information given in Ref. [22] for a mass Mh = 160 GeV, for 
which the sensitivity is almost the best (we will assume that the results are similar in the 
entire excluded mass range 158-175 GeV). Then, we consider the two scenarios 

discussed previously which, in practice, reduce the normalisation of the Higgs production 
cross section by ~ 20% (when all the uncertainties are added using the procedure of 
Ref. |5]) and ~ 40% (when the HERAPDF set is used to derive the central value of 
the cross section). We estimate the relative variation of the sensitivity and increase the 
integrated luminosity until we recover our initial sensitivity. Finally, we assume that 
the obtained relative variations of the sensitivity as well as the required luminosity to 
reproduce the initial sensitivity, would be the same for the CDF experiment. 

A naive attempt to reproduce the CDF results [22] was to use the background, signal 
and data numbers for all the search channels of Tables I-VIII without including the 
neural-network information or any treatment that uses shape information. This naive 
approach resulted in a sensitivity (95% CL/ctsm) ~ 12 times weaker than the CDF one. 
This large difference made us feel uncomfortable, as we would have needed to make the 
above assumptions over one order of magnitude difference for the sensitivity (or two orders 
of magnitude for the resulting necessary luminosity) compared to the CDF analysi^. 

To be as close as possible to the CDF analysis and results [22], we considered their 
neural network outputs for the 10 search channels (each one for the signals, backgrounds 
and data) presented in Figs. 2,4,- ■ -,16 to build the background only and the background 
plus signal hypotheses, implemented them in the program MClimit [23] and used a ratio 
of log-likelihood "a la LEP" as a test-statistic for which we combined the above channels; 
this provided the 95%CL/(Tsm sensitivity limit on the Higgs boson at the considered mass 
of Mh = 160 GeV. A median expected 95%CL/(Jsm limit of 50 = 1.35 has been obtained, 
to be compared to 5*0 = 1.05 in the CDF analysis; for the observed 95%CL/crsM limit, the 
agreement is better as we obtain 1.35 compared to 1.32. We feel thus satisfied with this 
rather close result as even the CDF and DO collaborations agree in their methods within 
only 10% accuracy for the same input Monte Carlo and data [21]. We therefore believe 
that we can safely adopt the three working hypotheses described above. 

We consider the first two scenarios in which the gg ^ H ^ WW — iiuu signal cross 
section has been reduced by 20% and 40%. In each case, the expected signals and the 
corresponding backgrounds at the Tevatron have been multiplied by a luminosity factor 
that has been varied. For each value of the luminosity factor, the corresponding median 
expected 95%CL/ctsm has been estimated and normalized to the initial sensitivity Sq = 
1.35 obtained above. The results are reported in Fig. [3] where the Tevatron luminosity 
is shown as a function of the obtained normalised sensitivity. The luminosity needed to 

^This factor w 12 gain in sensitivity obtained using neural network techniques (including spin- 
correlations, the main discriminant), is to be compared with the modest gain of w 30-50% envisaged 
by the LHC experiments. It turned, though, that the CDF cut-based analysis was not fully optimised. 
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Figure 3: The luminosity needed by the CDF experiment to recover the current sensitivity 
(with 5.9 fb^^ data) when the gg ^ H ^ iiuu signal rate is lowered by 20 and 40% and 
with a ±10% change in the pp^WW dominant background. 

recover the current Sq CDF sensitivity is given by the intersection of the vertical (blue) 
line with the luminosity curves. One sees that if ct^^h lowered by 20%, a luminosity 
of ~ 8 fb~^, compared to 5.9 fb~^ used in [22] would be required for the same analysis 
to obtain the current sensitivity. If the rate is lower by 40%, the required luminosity 
should increase to ~ 13 fb~^, i.e. more than a factor of two, to obtain the present CDF 
sensitivity. 

As an additional exercise, we also analyzed the impact of changing the normalization 
of the background cross sections by ±10%, as in our third scenario, simultaneously with 
lowering the signal by 20 and 40%. One sees that increasing/decreasing the background 
will degrade/improve the sensitivity and a ^ 10% higher/lower luminosity would be re- 
quired to recover the sensitivity. 

We conclude by noting that the reduction of the signal by 40% as would be the case 
if the HERAPDFs were used for its normalization, would reopen the entire mass range 
Mh = 158-175 GeV excluded by the CDF/DO analysis with 12.6 fb"^ combined data. 
Hence, we face the uncomfortable situation in which the Higgs exclusion limit depends 
on the considered PDF. 

Acknowledgements: We thank G. Altarelli, M. Chen, A. Cooper-Sarkar, M. Dittmar, 
A. Korytov, H. Prosper, G. Salam, M. Spira, P. Verdier for discussions. We acknowledge 
the projects SR/S2/JCB64 DST (India) and ANR CPV-LFV-LHC NT09-508531 (FR). 



^''The correlation between signal and background is implicitly taken into account as we use the results 
of [22]; we assume though that it is almost the same when another PDF set is adopted for both signal 
and background. 



8 



Erratum 



After our paper had appeared in Physics Letters B, we reahsed that an error occurred in 
the numerical analysis which had led to Fig. [1] for the gg ^ H production cross section 
when the four NNLO PDF sets are adopted. In the plot with the two HERAPDF sets, 
the central scales at which (J^^h has been evaluated were not set to fiR = fip = \Mh 
as it should have been, but at fiR = fip = which gives the minimal cross section 

once the scale uncertainty is included. This explains the large difference in the cross 
sectioij"]. up to 40%, between the MSTW and HERAPDF predictions. We thus present 
our mea culpa and produce in Fig. H] the correct figure where all scales are consistently 
set to fiR = fip = \Mh- The difference between the MSTW and HERAPDF predictions 
reduces now to ~ 20% at most, which is indeed much more reasonable. In this case, 
the smallest value of the cross section is given when using the ABKM set and amounts 
to ~ 20%-30% in the considered Higgs mass range as noticed in Ref. [5] (this difference 
is slightly larger if the new ABMIO PDF set is used [E]). Note that the same analysis 
presented for the LHC in Ref. [6] is not affected by this problem. 
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Figure 4: The gg ^ H cross section as a function of Mh when the four NNLO PDF sets, 
MSTW, ABKM, JR and HERAPDF, are used. In the inserts, shown are the deviations 
with respect to the central MSTW value. 

This error does not affect the subsequent discussion and almost does not change our 
conclusions. Indeed, the main analysis which led to Fig. 2 (which, we believe, is the most 
important result of our paper) is still valid as we estimate the PDF uncertainties within 
the MSTW set and our conclusion, that the theoretical uncertainty on the gg ^ H cross 
section at the Tevatron is ~ 40%, still holds true. 

^^We thank Graham Watt for pointing out to us that his calculation of the gg — ?> H cross section with 
HERAPDF does not lead to such a large difference. 
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Nevertheless, the interpretation of the CDF/DO limit when lowering the normalisation 
of the cross section, has to be modified. Instead of lowering the normalisation by 40%, one 
has to lower it by 30% which is the difference betweeen the MSTW and ABKM predictions. 
The luminosity needed by the CDF experiment to recover the present sensitivity is shown 
in Fig. [5] in this case. With this normalisation and including the 10% uncertainty on the 
backgroud rate, the needed luminosity to recover the present sensitivity will be slightly 
less than a factor of two. 




Figure 5: The luminosity needed by the CDF experiment to recover the current sensitivity 
(with 5.9 fb~^ data) when the gg ^ H ^ iiuu signal rate is lowered by 20 and 30% and 
with a ±10% change in the pp^WW dominant background. 

Note, however, that the updated results given by the CDF/DO experiments for the 
winter 2011 conferences with a luminosity of 7.1 fb~^ for CDF, lead to an exclusion limit 
that is slightly worse than the one quoted here and only the range Mh = 158-173 GeV 
is excluded. Thus, even for a 30% reduction of the production cross section only instead 
of the 40% used earlier, one still needs ~ 13 fb~^ data to recover the sensitivity obtained 
with 7.1 fb-^ 
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